Add to_tensor support for VideoEncoder #957

Dan-Flores · 2025-10-14T03:34:58Z

This PR enables encoding a video to tensor by adding a VideoEncoder constructor that accepts an AVIOContextHolder and file format. It is instantiated with AVIOToTensorContext.

Testing:

test_video_encoder_round_trip is updated to test to_tensor method.
The changes in AVIOTensorContext.cpp enable certain formats, such as avi and flv to be encoded correctly.
- Although we do not test these container formats in test_video_encoder_round_trip, without the changes, they would raise an error when attempting to decode the encoded frames, indicating an incorrectly encoded video.
- The added max field allows encoders that write the header last (doing a backwards seek) to output a correctly encoded tensor.

test/test_ops.py

src/torchcodec/_core/Encoder.cpp

NicolasHug

Thanks for the PR @Dan-Flores, nice work ! Left a few comments and suggestions but this looks great

src/torchcodec/_core/AVIOTensorContext.cpp

test/test_ops.py

src/torchcodec/_core/AVIOTensorContext.h

src/torchcodec/_core/Encoder.cpp

test/test_ops.py

NicolasHug · 2025-10-15T11:17:08Z

test/test_ops.py

    create_from_tensor,
    encode_audio_to_file,
    encode_video_to_file,
+    encode_video_to_tensor,


Let's add some more test:

let's also parametrize test_bad_input over the method, when relevant

let's have a similar test to

torchcodec/test/test_encoders.py

Line 345 in b084768

def test_against_to_file(

These tests can be easier to write (and read!) when we have the public Python VideoEncoder class. So if you prefer this can be left as a TODO, but we should make sure to follow up on that.

I'll add a similar test to this PR, hopefully it is fairly readable.
I may move some tests to test_encoders.py once I push a PR with the python API, since we really want to test the VideoEncoder entrypoint.

Parametrizing test_bad_input without a VideoEncoder class is fairly messy. I'll move this test to test_encoders.py and clean it up. For now, I've added the case where a bad format is passed in.

That sounds good, maybe we can just write a TODO to parametrize test_bad_input when we move it to test_encoders.py ?

NicolasHug · 2025-10-16T09:40:27Z

test/test_ops.py

            assert_close(s_frame, rt_frame, atol=atol, rtol=0)

+    @pytest.mark.parametrize(
+        "format", ("mov", "mp4", "avi", "mkv", "webm", "flv", "gif")


Let's mark the webm entry as slow (if it is!), see #968 (comment) for how to do that just for the webm entry, and not to the whole test

Thanks for the pointer, I've added the mark. The webm tests are in fact very slow.

I suspect its because the encoding is done in GRB instead of YUV, thanks to avcodec_find_best_pix_fmt_of_list choosing the least lossy format, and the webm encoder supports GRB. We will probably want to change this for efficiency.

pytest --durations=25 -k "TestVideoEncoderOps" ============================================================ slowest 25 durations ============================================================ 25.98s call test/test_ops.py::TestVideoEncoderOps::test_against_to_file[webm] 18.30s call test/test_ops.py::TestVideoEncoderOps::test_video_encoder_against_ffmpeg_cli[webm] 13.53s call test/test_ops.py::TestVideoEncoderOps::test_video_encoder_round_trip[to_tensor-webm] 13.33s call test/test_ops.py::TestVideoEncoderOps::test_video_encoder_round_trip[to_file-webm] 3.67s call test/test_ops.py::TestVideoEncoderOps::test_video_encoder_against_ffmpeg_cli[gif] 3.65s call test/test_ops.py::TestVideoEncoderOps::test_against_to_file[gif] 3.03s call test/test_ops.py::TestVideoEncoderOps::test_video_encoder_against_ffmpeg_cli[mkv] 2.78s call test/test_ops.py::TestVideoEncoderOps::test_video_encoder_against_ffmpeg_cli[mov] 2.71s call test/test_ops.py::TestVideoEncoderOps::test_video_encoder_against_ffmpeg_cli[mp4] 2.35s call test/test_ops.py::TestVideoEncoderOps::test_against_to_file[mkv] 2.26s call test/test_ops.py::TestVideoEncoderOps::test_against_to_file[mp4] 2.21s call test/test_ops.py::TestVideoEncoderOps::test_against_to_file[mov] 2.19s call test/test_ops.py::TestVideoEncoderOps::test_video_encoder_round_trip[to_tensor-mp4] 2.09s call test/test_ops.py::TestVideoEncoderOps::test_video_encoder_round_trip[to_file-mov]

NicolasHug

Thanks @Dan-Flores ! Left 3 non-blocking comments (one above, two below)

LGTM

NicolasHug · 2025-10-16T09:41:14Z

test/test_ops.py

+        torch.testing.assert_close(
+            self.decode(encoded_file).data, self.decode(encoded_tensor).data
+        )


These are uint8 so the default atol will be 0 anyway, but it's good to be explicit:

Suggested change

torch.testing.assert_close(

self.decode(encoded_file).data, self.decode(encoded_tensor).data

)

torch.testing.assert_close(

self.decode(encoded_file).data, self.decode(encoded_tensor).data, rtol=0, atol=0

)

NicolasHug · 2025-10-16T09:42:26Z

test/test_ops.py

    create_from_tensor,
    encode_audio_to_file,
    encode_video_to_file,
+    encode_video_to_tensor,


That sounds good, maybe we can just write a TODO to parametrize test_bad_input when we move it to test_encoders.py ?

Co-authored-by: Daniel Flores <[email protected]>

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 14, 2025

Dan-Flores commented Oct 14, 2025

View reviewed changes

test/test_ops.py Outdated Show resolved Hide resolved

Dan-Flores commented Oct 14, 2025

View reviewed changes

src/torchcodec/_core/Encoder.cpp Show resolved Hide resolved

Dan-Flores marked this pull request as ready for review October 14, 2025 19:51

NicolasHug reviewed Oct 15, 2025

View reviewed changes

NicolasHug referenced this pull request Oct 15, 2025

to_filelike, update test

3e97d89

NicolasHug reviewed Oct 15, 2025

View reviewed changes

NicolasHug reviewed Oct 16, 2025

View reviewed changes

NicolasHug approved these changes Oct 16, 2025

View reviewed changes

Daniel Flores added 8 commits October 16, 2025 14:27

to_tensor, AVIOTensorContext fix

7408430

update tensorContext vars, use std::max

62ad0e3

rename output_method, move mkv handling to c++

9afcb71

set reusable params for parametrized round trip test

bf779d5

simplify decode method by using VideoDecoder

c7f2c33

add test_against_to_file

e2dac79

test bad format

540d423

incorporate suggestions

231c68f

Dan-Flores force-pushed the video_to_tensor branch from 80d3999 to 231c68f Compare October 16, 2025 20:22

Dan-Flores merged commit 2117716 into meta-pytorch:main Oct 17, 2025
58 checks passed

Dan-Flores deleted the video_to_tensor branch October 17, 2025 13:30

NicolasHug pushed a commit to NicolasHug/torchcodec that referenced this pull request Oct 27, 2025

Add to_tensor support for VideoEncoder (meta-pytorch#957)

f39ebfd

Co-authored-by: Daniel Flores <[email protected]>

Add to_tensor support for VideoEncoder #957

Add to_tensor support for VideoEncoder #957

Uh oh!

Conversation

Dan-Flores commented Oct 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

NicolasHug left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

NicolasHug Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Dan-Flores Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

Dan-Flores Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

NicolasHug Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

NicolasHug Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

Dan-Flores Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

NicolasHug left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

NicolasHug Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

NicolasHug Oct 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Dan-Flores commented Oct 14, 2025 •

edited

Loading

NicolasHug Oct 15, 2025 •

edited

Loading

NicolasHug left a comment •

edited

Loading